This Page Is Inserted by IFW Operations 
and is not a part of the Official Record 

BEST AVAILABLE IMAGES 



Defective images within this document are accurate representations of 
the original documents submitted by the applicant. 

Defects in the images may include (but are not limited to): 



BLACK BORDERS 

TEXT CUT OFF AT TOP, BOTTOM OR SIDES 
FADED TEXT 
ILLEGIBLE TEXT 
SKEWED/SLANTED IMAGES 
COLORED PHOTOS 

BLACK OR VERY BLACK AND WHITE DARK PHOTOS 
GRAY SCALE DOCUMENTS 



IMAGES ARE BEST AVAILABLE COPY. 



As rescanning documents will not correct images, 
please do not report the images to the 
Image Problem Mailbox. 



- (12) INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(19) W rid Intellectual Property Organization 

International Bureau 

(43) International Publication Date 
7 September 2001 (07.09.2001) 




PCT 



(10) lnternati nal Publication Number 

WO 01/65424 A2 



(51) International Patent Classification 7 : G06F 17/50 

(21) International Application Number: PCT/US0 1/066 10 

(22) International Filing Date: 28 February 2001 (28.02.2001) 

(25) Filing Language: English 

(26) Publication Language: English 



(30) Priority Data: 

60/185,826 
09/797,386 



29 February 2000 (29.02.2000) US 
28 February 2001 (28.02.2001) US 



(71) Applicant: CADENCE DESIGN SYSTEMS, INC. 
[US/US]; 2655 Seely Avenue, San Jose, CA 95 1 34 (US). 

(72) Inventor: ARSINTESCU, Bogdan; 335 Elan Village 
Lane, ApL 316, San Jose, CA 95134 (US). 



(74) Agents: SMOOT, Roland, N. et al.; Lyon & Lyon LLP, 
633 West Fifth Street, Suite 4700, Us Angeles, C A 9007 1 - 
2066 (US). 

(81) Designated States (national): AE, AG, AL, AM, AT, AU, 
AZ, BA, BB, BG, BR, BY, BZ, CA, CH, CN, CO, CR, CU, 
CZ, DE, DK, DM, DZ, EE, ES, FI, GB, GD, GE, GH, GM, 
HR, HU, ID, EL, IN, IS, JP, KE, KG, KP, KR, KZ, LC, LK, 
LR, LS, LT, LU, LV, MA, MD, MG, MK, MN, MW, MX, 
MZ, NO, NZ, PL, PT, RO, RU, SD, SE, SG, SI, SK, SL, 
TJ, TM, TR, TT, TZ, UA, UG, UZ. VN, YU, ZA, ZW. 

(84) Designated States (regional): ARIPO patent (GH, GM, 
KE, LS, MW, MZ, SD, SL, SZ, TZ, UG, ZW), Eurasian 
patent (AM, AZ, BY, KG, KZ, MD, RU, TJ, TM), European 
patent (AT, BE, CH, CY, DE, DK, ES, FI, FR, GB, GR, IE, 
IT, LU, MC, NL, PT, SE, TR), OAPI patent (BF, BJ, CF, 
CG, CI, CM, GA, GN, GW, ML, MR, NE, SN, TD, TG). 



[Continued on next page] 



= (54) Title: DEVICE LEVEL LAYOUT OPTIMIZATION IN ELECTRONIC DESIGN AUTOMATION 



< 
if) 



O 



600 

602 



L ! l '' 1 -l v r ' i- r ^l 




(57) Abstract: According to a custom physical design 
process for integrated circuits, a method is provided for 
creating layouts characterized by optimal-length chains 
for different types of MOS circuit designs, including 
mixed-signal MOS designs. A chaining engine having 
a device library and operating on a computer converts a 
circuit representation such as a netlist file into a layout file 
characterized by optimal-length chains. Such conversion 
may be accomplished in linear time. From a circuit 
representationi a bipartite graph is prepared. A starting node 
in the bipartite graph is selected according to enumerated 
Euler trail algorithm rules. A constraint greedy walk is 
conducted to generate layout chains, and is preferably 
repeated until the bipartite graph is exhausted of edges, at 
which point the resulting layouts are returned. A single 
optimal layout solution can be obtained without enumerating 
all the possible layout options, resulting in a considerable 
speed advantage over conventional techniques. The 
disclosed chaining methods may be used, for example, in 
both customizing layouts and in creating automated layouts 
when enhanced with optional partitioning and/or folding 
capabilities. 
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S P E C I F LC A HO N 
TITLE OF THE INVENTION 

DEVICE LEVEL LAYOUT OPTIMIZATION IN ELECTRONIC DESIGN 

AUTOMATION 

CROSS-REFERENCE TO RELATED APPLICATION 
[0001] This application claims benefit of U.S. Provisional Application Serial 
No. 60/185,826, filed on February 29, 2000, hereby incorporated by reference as 
if set forth fully herein. 

BACKGROUND OF THE INVENTION 

1 . Field of the Invention 

[0002J The field of the invention generally relates to electronic design 
automation for integrated circuits and, more particularly, to systems and methods 
for optimizing the physical layout of elements in integrated circuit designs. 

2. Background 

[0003] Chip designers often use electronic design automation (EDA) 
software tools to assist in the design process, and to allow simulation of a chip 
design prior to prototyping or production. Chip design using EDA software tools 
generally involves an iterative process whereby the chip design is gradually 
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perfected. Typically, the chip designer builds up a circuit by inputting information 
at a computer workstation generally having high quality graphics capability so as 
to display portions of the circuit design as needed. A top-down design 
methodology is commonly employed using hardware description languages 
(HDLs), such as Verilog® or VHDL, for example, by which the designer creates an 

I integrated circuit by hierarchically defining functional components of the circuit, 

i 

and then decomposing each component into smaller and smaller components. 
[0004] Two of the primary types of components used in integrated circuits 
are datapaths and control logic. Control logic, typically random logic, is used to 

10 control the operations of datapaths. Datapath areas of the circuit perform 
functional operations, such as mathematical or other operations. 
[0005] The various components of an integrated circuit are initially defined 
by their functional operations and relevant inputs and outputs. The designer may 
also provide basic organizational information about the placement of components 

15 in the circuit using floorplanning tools. During these design states, the designer 
generally structures the circuit using considerable hierarchical information, and 
has typically provided substantial regularity in the design. 
[0006] From the HDL or other high level description, the actual logic cell 
implementation may be determined by logic synthesis, which converts the 
20 functional description of the circuit into a specific circuit implementation. The logic 
cells are then "placed" (i.e., given specific coordinate locations in the circuit layout) 
and "routed" (i.e., wired or connected together according to the designer's circuit 
definitions). The placement and routing software routines generally accept as 
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their input a flattened netlist that has been generated by the logic synthesis 
process. This flattened netlist identifies the specific logic cell instances from a 
target standard cell library, and describes the specific cell-to-cell connectivity. By 
application of a physical design process, the logic cells of the netlist file are placed 
and routed, resulting in a layout file. 

[0007] However, for some HDL modules (for example, analog modules or 
high performance custom digital designs), logic synthesis may be impossible 
because of the lack of specific logic cells in the library. In these situations, from 
the HDL or other high level description, the actual circuit implementation is 
typically determined by custom layout design, which converts a netlist associated 
with the circuit design into a specific circuit layout implementation. The layout 
modules are then placed and routed. In this situation, the placement and routing 
software routines generally accept as their input a flattened netlist that has been 
generated by the custom layout process. This flattened netlist identifies the 
specific layout modules from a target device library, and describes the specific 
device-to-device connectivity. By application of a physical design process, the 
layout modules created from the netlist file are placed and routed, resulting in a 
production-ready layout file. 

[0008] Device level layout editing for custom integrated circuit block design 
is a time-intensive task for present-day mixedrsignal circuit designs. Mixed-signal 
designs may include combinations of CMOS, domino MOS logic, analog, and/or 
other MOS topologies. Custom layout techniques make the task of device level 
layout editing for mixed-signal designs a major bottleneck in the entire design 
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process, due in large part to the specialized features required to optimize the 
design. 

[0009] One aspect of device level layout is arriving at a physical ordering of 
devices. The transistors of a circuit can often be ordered in different 
5 configurations without changing their logical function. Properly ordering devices in 
a circuit can increase the circuit's speed and reduce its layout area. 
[0010] Module editing refers to a complex of layout editing and generation 
features that yield optimized layout for a set of devices and cells. The focus of 
module editing generally is device-level rather than shape-level. For MOS device 
level layout, module optimization methods comprise mainly device abutment and 
splitting. A primary optimization goal is to minimize layout area and unwanted 
interconnection parasitics. Minimizing area means that a maximum number of 
abutments should be created between the devices in the cell. 
[0011] According to one well-known device layout technique, for a circuit 
design in which CMOS elements are present the transistors are placed in two 
parallel rows: the P-type transistors in one row, and the N-type transistors in 
another row. Power rails are routed along the two rows on the outside, and 
intracell routing is run between the rows. To minimize layout area, a primary goal 
is to place transistors in such a way that gate signals are aligned and the 
drain/source diffusions of adjacent transistors are abutted to the maximum extent 
possible, thereby reducing total diffusion area and minimizing the number of 
separations between diffusion strips. 
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[0012] While several algorithmic solutions have been proposed to optimize 
MOS device level layout, these solutions have limitations that restrict their utility. 
Prior solutions have generally been targeted toward particular circuit topologies, 
such as CMOS standard cells on the one hand, or specific analog circuits on the 

f 

other. Furthermore, several of these solutions also suffer from inefficiencies in 
optimization speed. 

[0013] It would therefore be advantageous to provide an improved circuit 
layout technique that results in rapid generation of optimal length chains and is 
applicable to any MOS circuit topology, including mixed-signal designs. 

SUMMARY OF THE INVENTION 
[0014] The present invention is directed in one or more aspects to methods 
for obtaining optimal layouts for integrated circuit design according to a custom 
physical design process. 

[0015] In a first aspect of the invention, a method is provided for creating 
layouts characterized by optimal-length chains for different types of MOS circuit 
designs, including mixed-signal MOS designs. In a second separate aspect of the 
invention, a chaining engine having a device library and operating on a computer 
converts a circuit representation such as a netlist file into a layout file 
characterized by optimal-length chains, and such conversion may be 
accomplished in linear time. From a circuit representation, a bipartite graph (such 
as a graph having properties of an Euler graph) is prepared. A starting node in 
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the bipartite graph is selected according to enumerated Euler trail algorithm rules. 
A constraint greedy walk is conducted to generate layout chains, and is preferably 
repeated until the bipartite graph is exhausted of edges, at which point the 
resulting layouts are returned. Advantageously, a single optimal layout solution 
can be obtained without enumerating all the possible layout options, resulting in a 
considerable speed advantage over conventional techniques. The disclosed 
chaining methods may be used, for example, in both customizing layouts and in 
creating automated layouts when enhanced with optional partitioning and/or 
folding capabilities. 

[0016] In a third separate aspect of the invention, any of the foregoing 
aspects are contemplated in combination for additional advantage. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0017] FIG. 1 is a diagram of a computer system that may be used in 
connection with various embodiments of the invention as described herein. 
[0018] FIG. 2 is a diagram of a simplified integrated circuit as may be 
generated using a computer system such as shown in FIG. 1 or according to the 
process set forth in FIG. 3. 

[0019] FIG. 3 is a diagram of a general process flow for a circuit design, 
illustrating various levels of circuit abstraction. 

[0020] FIG. 4 is a process flow diagram for custom physical design of an 
integrated circuit. 

[0021] FIG. 5a is a schematic of a simple CMOS circuit. 
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[0022] FIG. 5b illustrates two possible layouts for the transistor devices 
depicted in FIG. 5a. 

[0023] FIG. 6a is an alternate schematic of the simple CMOS circuit 
depicted in FIG. 5a. 

5 [0024] FIG. 6b is a bipartite graph representation of the CMOS circuit 
illustrated in FIG. 6a. 

[0025] FIG. 7 is a bipartite graph for a mixed-signal MOS circuit having both 
complementary and non-complementary portions. 
[0026] FIG. 8a is a netlist of a second CMOS circuit. 
1 0 [0027] FIG. 8b is a bipartite graph representation of the circuit depicted in 
FIG. 8a. 

[0028] FIG. 8c is a possible layout for the circuit depicted in FIG. 8a. 
[0029] FIG. 9a is a circuit diagram. 

[0030] FIG. 9b is a bipartite graph representation of the circuit of FIG. 9a. 
1 5 [0031] FIG. 10 is a flow diagram for a method for creating layouts for MOS 
circuit designs, including mixed-signal MOS designs, characterized by optimal- 
length chains. 

[0032] FIG. 11 is a flow diagram for the step of selecting a starting node 
depicted in FIG. 10. 

20 [0033] FIGS. 12a-i are bipartite graphs illustrating an application of the 
constraint greedy walk portion of the chaining method depicted in FIGS. 10-11 to 
the circuit depicted in FIGS. 5a and 6a-b. 

[0034] FIGS. 13a-e and 13g-j are bipartite graphs identical to FIGS. 12a-i. 
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[0035] FIG. 13f is a partial layout of the circuit depicted in FIGS. 5a and 6a 

obtained with the chaining method illustrated in FIGS. 10-11. 

[0036] FIG. 13k is a complete layout of the circuit depicted in FIGS. 5a and 

6a obtained with the chaining method illustrated in FIGS. 10-11 . 

[0037] FIGS. 14a-i are bipartite graphs illustrating a first alternative 

application of the constraint greedy walk portion of the chaining method depicted 

in FIGS. 10-1 1 to the circuit depicted in FIGS. 5a and 6a-b. 

[0038] FIGS. 15a-e and 15g-j are bipartite graphs identical to FIGS. 14a-i. 

[0039] FIG. 15f is a partial alternative layout of the circuit depicted in. FIGS. 

5a and 6a obtained with the chaining method illustrated in FIGS. 10-11. 

[0040] FIG. 15k is a complete alternative layout of the circuit depicted in 

FIGS. 5a and 6a obtained with the chaining method illustrated in FIGS. 10-11. 

[0041] FIG. 1 6a is a circuit schematic for a third CMOS circuit. 

[0042] FIGS. 16b-c illustrate possible layouts for the circuit depicted in FIG. 

16a. 

[0043] FIG. 17a is a layout for a fourth circuit having uniform height bounds. 
[0044] FIG. 17b is a layout for the circuit represented in FIG. 17a, but 
without uniform height bounds. 

[0045] FIG. 18a illustrates two circuits of different topologies. 

[0046] FIG. 18b is a layout for the circuits provided in FIG. 18a following a 

folding step. 

DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 
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[0047] FIG. 1 is a diagram of a computer system that may be used in 
connection with various embodiments of the invention as described herein. As 
shown in FIG. 1, a computer system 100 includes a computer 110 connected to a 
display 191 and various input-output devices 192. The computer 110 may 
comprise one or more processors (not shown), as well as working memory (e.g., 
RAM) in an amount sufficient to satisfy the speed and processing requirements of 
the system. The computer 110 may comprise, for example, a SPARC™ 
workstation commercially available from Sun Computers, Inc. of Santa Clara, 
California, or any other suitable computer. 

[0048] The computer 110 contains stored program code including, in one 
embodiment, a shape layout editor 120, a connectivity-driven layout editor 130, 
and custom design placement and routing tools 140, all as part of a custom 
design flow 115. The shape layout editor 120 provides for the creation of layouts 
at polygon level. One example of such an editor is Virtuoso® Layout Editor, 
commercially available from Cadence Design Systems, Inc., of San Jose, 
California. The connectivity-driven layout editor 130 in FIG. 1 provides for the 
construction of correct-by-construction layouts, having as input a circuit 
description such as a netlist. One example of this editor is Virtuoso® XL Layout 
Editor details of which may be found, for example, in the Virtuoso XL Layout 
Editor User Manual, release number 4.4.5, commercially available from Cadence 
Design Systems, Inc. The custom design placement and routing tools 140 
perform placement and routing of custom layout modules according to specific 
custom constraints inferred from the circuit description or defined by the user. 
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Commercially available examples of custom design placement and routing tools 
are Virtuoso® Custom Placer and Virtuoso® Custom Router, also commercially 
available from Cadence Design Systems, Inc. Preferably, the components of the 
custom design flow 1 1 5 are dynamically integrated. 

[0049] In support of the above-mentioned custom design flow components, 
standard synthesis, placement, and routing flow tools 145 may be usefully 
employed. Such tools 145 may include a chip floorplanner 150, global/detail 
router 160, standard cell placer 170, logic synthesizer 180, and HDL editor 190. 
Operation of the chip floorplanner 150, global/detail router 160, standard cell 
placer 170, logic synthesizer 180, and HDL editor 190 is conventional, as the 
design of these components is well known in the art of electronic design 
automation. Commercially available examples of these system components are 
Preview™, Cell3™, QPIace™, Synergy™, and Verilog®, respectively. 
[0050] The computer 110 is preferably coupled to a mass storage device 
(e.g., magnetic disk or cartridge storage) providing a layout database 195 with 
which the foregoing system components interface. The layout database 195 may 
be implemented using the EDIF database standard. The computer 110 may also 
comprise or be connected to mass storage containing one or more component 
libraries (not shown) specifying features of electrical components available for use 
in circuit designs. 

[0051] Referring now to FIG. 2, there is shown an illustration of a simplified 
integrated circuit 200 that may be represented by circuit design data stored in the 
layout database 195. In actual, more realistic integrated circuit designs, the 
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integrated circuit 200 would be far more complicated. However, FIG. 2 is useful 
for purposes of illustration. As shown therein, the integrated circuit 200 comprises 
of a plurality of control regions 201, datapath regions 203, and memory 205. The 
various control regions 201, datapath regions 203 and memory 205 are 
5 interconnected with databuses 207 generally spanning multiple bits. Each 
datapath region 203 may comprise a plurality of datapath functions 209. A 
datapath function 209 may utilize some or all of the bits available from the databus 
207. A datapath function 209 may comprise a plurality of cell instances 215 which 
enable some form of signal or logic transformation of the data passed by the 

10 databus 207. The cell instance 215 within a datapath function 209 generally 
operates on the data carried on the datapath function 209. Custom layout design 
generally applies to control logic 201, memory 205, databus 207, and analog 
block 210, for which specific instances do not exist in the library and custom 
device-level layouts must be created. 

15 [0052] As represented in the schema of the layout database 195, the 
integrated circuit 200 is comprised of a plurality of instances and a plurality of 
nets. A net interconnects a number of instances, by associating pins on each of 
the instances or, more generally, by associating the inputs and outputs of a 
number of instances. 

20 [0053] FIG. 3 is a diagram of a general process flow for a circuit design, 
illustrating some of the various levels of circuit abstraction as described above. 
As illustrated in FIG. 3, a register transfer logic (RTL) file 301 in the form of an 
HDL file or other high level functional description undergoes a compile process 
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303, which typically includes some form of logic synthesis, and converts the 
functional description of the circuit into a specific circuit implementation which may 
be stored in the form of a netlist file 304. As part of the compile process 303, a 
component and device library 306 is generally referenced, which stores 
information concerning what types of design components and devices are 
available, and the characteristics of those design components and devices which 
are needed in order to determine their functional connectivity. For custom design, 
the netlist file 304, as previously noted, generally identifies the specific devices 
from a device-level library, and describes the specific device-to-device 
connectivity. 

[0054] By application of a custom physical design process 309 shown in 
FIG. 3, the devices of the netlist file 304 are instantiated, layout modules are 
generated, and these modules are then placed and routed, resulting in a layout 
file 310. The device library 306 is utilized in this process stage in order to obtain 
information concerning the sizes of the devices and other components that may 
be present in the netlist file 304. 

[0055] From the layout file 310, a verification process 312 may be run, as 
further illustrated in FIG. 3, resulting in a mask file 315 in, for example, a GDSII or 
CIF format. The mask file 315 may be provided to a foundry/and contains 
enough information to allow the foundry to manufacture an actual integrated circuit 
therefrom. 

[0056] Various embodiments as described herein relate to the physical 
design process 309 depicted in FIG. 3, and more specifically to the task of device 
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level layout optimization to arrive at a particular layout. In accordance with a 
preferred embodiment as disclosed herein, an automated custom physical design 
(APCD) flow is used to facilitate the rapid design of custom layout blocks. FIG. 4 
depicts a basic strategy for a custom physical module generation process 400. As 
illustrated in FIG. 4, an input application procedural interface (API) 402 provides a 
circuit representation, such as a netlist file, to a chaining engine 406. Chaining in 
that context generally refers to the process of linking circuit elements according to 
instructions in a circuit representation. The output of a chaining engine 406 is 
provided to an output API 410, typically resulting in a layout file. Preferably, the 
custom physical module generation process 400 is augmented with additional 
steps, such as a partitioning step 404 and a folding step 408. The partitioning 
step 404 involves data preparation that reduces the size of the input to the 
chaining engine 406. Partitioning is typically performed based on the original 
circuit hierarchy and user specifications. A reduced input size, having fewer 
elements, ensures that shorter and more placement-useful chains will be 
obtained. The folding step 408 preferably involves data finishing performed after 
chaining is complete; more specifically, folding splits a chained circuit into multiple 
parallel devices to address user-defined width constraints. The physical design 
strategy depicted in FIG. 4 is utilized, for example, in the Virtuoso ® XL Layout 
Editor referred to previously. 

[0057] The process illustrated in FIG. 4 preferably arrives at an optimal 
layout, having a minimal area and short wire lengths. Arriving at an optimal layout 
can be quite challenging. Even a simple circuit design typically can be 
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implemented in a multiplicity of possible layouts. For example, FIG. 5a depicts a 
simple CMOS cell schematic 420, while FIG. 5b depicts two possible layouts 424. 
426 in which the schematic 420 can be implemented. As circuit complexity and 
the number of circuit elements increases, the number of possible layouts grows 
dramatically. 

[0058] One way of modeling the possible abutments between transistors in 
a circuit is through a bipartite graph. A bipartite graph may be used as a modeling 
tool in creating a layout, such as the layouts depicted in FIG. 5b. For example, 
FIG. 6a provides a schematic for a simple CMOS circuit 440 including multiple 
transistors, and FIG. 6b provides its bipartite graph representation. A bipartite 
graph in one aspect represents the possible abutments between pairs of 
transistors. Each node in the bipartite graph 442 corresponds to a set of pairwise 
abuttable P-type or N-type transistors, that is, transistors sharing the same P-type 
or N-type drain/source diffusion net (connection) in the circuit 440. In the graph 
442, the "left" nodes 444 correspond to the nets to which PMOS transistors are 
connected with either the source and drain terminal. Similarly, the "right" nodes 
446 correspond to the NMOS transistors. The edges 448 correspond to a set of 
NMOS and PMOS pairs that share the same gate connection and the same pair 
of nodes. An edge exists between two nodes if and only if an abutment is 
possible between two pairs which are composed of the transistors represented by 
these two nodes. A bipartite edge corresponds to a pair of transistors with a 
common gate connection that can abut on the nets corresponding to the nodes it 
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connects. A bipartite edge in this context always connects a "right" node to a "left" 
node. In FIG. 6b, all of the edges 448 shown are bipartite edges. 
[0059] FIG. 7 depicts an exemplary bipartite graph 50 for a mixed-signal 
MOS circuit, illustrating the presence of both complementary and non- 
5 complementary portions. Nodes 1-8 correspond to the complementary portion of 
the circuit, and include bipartite edges 1.7, 2.5, 2.6, 3.8, and 4.6. The graph 50 
further nodes 9-12 that correspond to a non-complementary portion of the circuit, 
which portion is characterized by non-bipartite edges 4.9, 10.1 1 , and 8.12. A non- 
bipartite edge corresponds to a transistor that does not have a complementary 

10 pair. Generally, a non-bipartite edge always connects nodes on the same side of 
a bipartite graph. The degree of a node reflects the number of bipartite edges 
corresponding to that node. For example, node 1 has only one corresponding 
bipartite edge 1.7, and therefore has a degree of one, whereas node 2 has two 
corresponding bipartite edges 2.5 and 2.6, and therefore has a degree of two. A 

1 5 node with an odd number of corresponding bipartite edges is said to have an odd 
degree, while a node with an even number of corresponding bipartite edges is 
said to have an even degree. Therefore, the terms "odd" and "even" as used 
herein in connection with nodes apply only to the number of bipartite edges 
corresponding to a node. A node having only non-bipartite edges would not be 
20 characterized as having a degree, whether odd or even. 

[0060] A further example showing a bipartite graph in context as a layout 
tool is provided in FIGS. 8a-8c. FIG. 8a illustrates a CMOS circuit and information 
relating to its netlist representation; FIG. 8b shows a bipartite graph structure of 
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that circuit; and FIG. 8c illustrates a resulting layout. In FIGS. 8a-8c, left node 
labeled "out" has the NMOS devices N1 and N5 connected, and right net labeled 
"C" has the PMOS devices P0 and P1 connected. An edge exists between "out" 
and "C" because net "G1" and "G2" connect the pairs (N1, PI) and (N5, P0) 
',; respectively. The edge completes the abutments N1, N5 and P1, P0 on the nets 
"out" and "C" respectively, with the gate connections G1 and G2 aligned. 
[0061] Since it represents all possible abutments between pairs of 
transistors in a circuit, a bipartite graph typically has redundant information. A 
concept called mutual exclusivity assists in discovering redundant information, 

10 and permitting a greedy walk (discussed below) to continue through only one of 
the redundant paths. A bipartite graph may include "mutually exclusive edges" 
and "essential edges." Two possible abutments are referred to as "mutually 
exclusive" if at most one of them can be in any solution. Otherwise, they are 
referred to as "compatible." In the bipartite graph, two edges that represent two 

15 mutually exclusive possible abutments, respectively, are "mutually exclusive 
edges." Restated, two edges that represent to a connection between nodes / and 
y, and respectively nodes /' and j\ corresponding to transistor gates k /, and 
respectively transistor gates W /' are mutually exclusive if either the first or second 
gate connections denoted by k I, k' F and the nodes to which these edges are 

20 connected are not common at either end. In FIG. 6b, wherein each node 448 is 
labeled with a letter "e" followed by subscripts and superscripts, the subscripts 
refer to the nodes (denoted as /, ; or /' y" above) and the superscripts refer to the 
transistor gates (denoted as k I or kf above). 
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[0062] An "essential" abutment is an abutment that must appear in any 
solution derived from the current node. In the bipartite graph, an edge that 
represents an essential abutment is an "essential edge." In other words, an 
"essential edge" is an edge that must appear in any solution. 
5 [0063] To further illustrate the concepts of essential and mutually exclusive 
edges, attention is directed to FIGS. 9a and 9b. FIG. 9a is a circuit diagram with 
labeled nodes. In FIG. 9a, the abutment between the N5 transistors e and f is 
through the GND connection in any optimal solution. For this reason, the node 
GND and the node N1, and the edge connecting them, are essential in any 

10 optimal solution. FIG. 9b is a bipartite graph for the circuit provided in FIG. 9a. 
The letters accompanying each edge refer to the transistor gates in FIG. 9a 
(denoted as k lor k'l' above, or as superscripts in FIG. 6b). To avoid cluttering the 
illustration, the edges do not include separate labels referring the nodes (denoted 
as /,yor /'/above, or as subscripts in FIG. 6b); however, as each node is labeled 

1 5 separately, the starting and ending nodes for each edge is evident. 

[0064] With regard to mutual exclusivity, FIG. 9b has five sets of mutually 
exclusive edges: (1) edge ab between nodes VDD and N2, and edge ab between 
nodes N2 and O; (2) edge be between nodes VDD and N3, and edge be between 
N3 and O; (3) edge cd between nodes VDD and N4, and edge cd between nodes 

20 N4 and O; (4) edge de between nodes VDD and N5, and edge df between nodes 
N5 and O; and (5) edge ef between nodes N5 and N1, and edge ef between 
nodes N1 and GND. Since edge ef connecting nodes N1 and GND was identified 
as an essential edge which must be included in any optimal solution, the edge ef 
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connecting nodes N1 and N5 cannot be included in an optimal solution since it is 
in a mutually exclusive set with the edge ef connecting nodes N1 and GND. 
[0065] Having introduced the bipartite graph model, focus will now be 
directed on the chaining step of the physical layout process, deferring a more 
detailed discussion of optionally associated partitioning and folding steps until 
later. A method for creating layouts characterized by optimal-length chains for 
any type of MOS circuit designs, including mixed-signal MOS designs, is provided 
in FIG. 10. Preferably, the method depicted in FIG. 10 is embedded in a software 
routine that may be operated on a computer system, such as the computer 
system 100 depicted in FIG. 1. A first step 600 in the method is providing a circuit 
representation, which may be a netlist or other representation, as an input. 
Initialization of variables, such as layout number, is a following step 602. A next 
step 604 is building a bipartite graph (a simple example of which is shown in FIG. 
6b) from the circuit representation provided as an input. The bipartite graph 
identifies complementary portions of a circuit, irrespective of its size and topology. 
When preparing the bipartite graph, "essential edges" and "mutually exclusive 
edges" should be identified, as this identification information will be considered in 
a subsequent step. This identification may be performed by computation, and is 
automated. One method of implementing this identification is to annotate the 
bipartite graph with the presence of essential and mutually exclusive edges. The 
next step 606 involves examination of the bipartite graph to determine whether it 
contains any edges. 
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[0066] So long as the bipartite graph is not empty, the method proceeds to 
the step of selecting a starting node 608 for initiating a constraint greedy walk 625 
to generate a chain. Attention is directed to FIG. 11 for further details on the step 
of selecting a starting node 608. The bipartite graph preferably has the same 
properties as an Euler graph; an optimal length walk thus preferably starts 
according to the following Euler trial algorithm rules, applied in descending order: 

(a) According to a first decision block 610, if there is present in the 
bipartite graph a node with an odd degree, which node also has a 
bipartite edge traveling to another node of odd degree, then such 
node with the lowest odd degree is selected as the starting node for 
the greedy walk, consistent with block 612. 

(b) If the first condition is not satisfied, then according to a second 
decision block 614, if there is present in the bipartite graph any node 
with an odd degree, then such node with the lowest odd degree is 
selected as the starting node for the greedy walk consistent with 
block 616. Inherently from the Euler trail properties, this implies that 
the entire greedy walk will end at a node with an odd degree 

(c) If neither of the first two conditions are satisfied, then according to a 
third decision block 618, the greedy walk may start at any other 
node having a bipartite edge, consistent with block 620. 
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(d) If none of the first three are satisfied (i.e., no nodes having bipartite 
edges are present), then the greedy walk may start at any node 
having at least one edge connected to it, consistent with block 622. 

[0067] The starting node obtained according to these rules generally 
guarantees finding an optimal solution. 

[0068] Referring back to FIG. 10, each greedy walk is preferably performed 
after creation of a new chain 624. The constraint greedy walk 625 may involve 
several sub-steps. According to one embodiment, a first sub-step 628 is 
determining whether an edge can be selected from the starting node. If the 
answer is affirmative and an edge can be selected, then a second inquiry 630 is 
performed to determine whether any bipartite edges are present in the bipartite 
graph 604. If any bipartite edges are present, then a bipartite edge is selected to 
continue the walk, selecting an essential edge first in preference, according to 
sub-step 632. Only if no bipartite edges are present will a non-bipartite edge be 
selected to continue the walk, and the walk will proceed thereafter only with non- 
bipartite edge until none are available, according to substep 633. Thereafter, the 
selected edge is added to the chain 624 in sub-step 636. Following addition of the 
edge to the chain is, at substep 638, deletion from the bipartite graph 604 of any 
edges mutually exclusive to the edge just selected. As noted above, mutual 
exclusivity identifies redundant sets of edges, and such sets were identified 
previously when the bipartite graph was generated . Yet it is not until one edge of 
a mutually exclusive set is selected that the other edges in that set will be subject 
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to deletion, for it is not until one edge is selected that the unnecessary edges are 
known. Deletion of the remaining edges in a mutually exclusive set permits the 
greedy walk to continue through only one of the redundant paths. 
[0069] The next sub-step 640 is setting the node as the other end of the 
current edge. The method then returns to the inquiry at sub-step 638, and the 
chain continues to grow in like fashion until no further edges can be selected from 
the node. At that point, the completed chain is returned from the greedy walk 625 
at sub-step 642. 

[0070] Following the constraint greedy walk 625 is creation of a layout 644 
using devices from the reference library 306, depicted in FIG. 3, for the new.chain 
624. Each layout 644 is stored in a memory, such as is present in the computer 
system 100 depicted in FIG. 1. Continuing to refer to FIG. 10, the next step 646 is 
deletion of the edges visited during the constraint greedy walk 625. These steps 
608-646 continue until the bipartite graph 604 is emptied of edges, at which point 
the resulting layouts 650 are returned. 

[0071] The constraint greedy walk is a modified Euler walk that yields 
chains for the NMOS and PMOS devices simultaneously. The complexity of the 
overall method is 9 (n x s), where n is the number of nets and s is the number of 
terminals per nets. For real-life circuits, the number of nets is proportional with the 
number of devices and the number of terminals per net is limited, hence the 
algorithm complexity for the present method is linear with the size of the circuit. 
Also, the bipartite graph of any MOS circuit can be constructed in linear time. 
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[0072] A chaining method in accordance with the general principle of the 
technique illustrated in FIGS. 10-11 may be restated in more generalized 
language, as follows. First the complementary portion of the circuit is identified, 
irrespective of size and topology, in a bipartite graph. Next, a single optimal 
solution for any complementary MOS sub-circuit is obtained, as a pair of NMOS 
and PMOS chains, utilizing a constraint greedy walk through the bipartite graph. 
In other words, the greedy walk step starts with the assumption that the circuit 
design includes CMOS elements. The greedy walk processes each 
complementary pair until none are left in the bipartite graph. When one walk is 
completed, that is, a NMOS and PMOS pair of chains is found, another walk is 
attempted for the remaining edges in the graph. Since each edge is visited only 
once, after all complementary pairs have been exhausted, then a simpler walk 
may be continued to process the remaining edges. The chaining method also 
yields placement constraints such as alignment and orientation. That is, the 
devices in the same chain are aligned along the abutment direction. Also, the 
chain pair is aligned such that the device gates are in vertical alignment. The 
solution obtained is optimal with respect to the number of abutments created per 
circuit and the number of gate alignments. 

[0073] FIGS. 12a-i and 13a-k illustrate an application of a preferred 
constraint greedy walk portion of the chaining method described in connection 
with FIGS. 10-11 to the circuit depicted in FIGS. 5a and 6a-b. FIG. 12a provides 
a bipartite graph illustrating the selection of the starting node v p 1 according to step 
608 of FIGS. 10-11, and further illustrates selection of a first edge (bolded) 
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according to steps 628-633. The first edge if added to the chain according to step 
636, and as there are no edges mutually exclusive with the first edge just 
selected, node v n 2 is set as the current edge according to step 640. 
[0074] FIG. 12b illustrates the selection of a second edge (bolded) 
according to the constraint greedy walk in step 628. The second edge is then 
added to the chain. FIG. 12c illustrates the deletion of edges mutually exclusive 
to the second edge, with the deleted edges shown in bold dashed lines, according 
to step 638. FIG. 12d illustrates the selection of a third edge (bolded) according 
to the constraint greedy walk. Since no further edges can be selected from the 
node v n 1 . the chain is returned and appended to the layout according to. steps 
642-644, and the visited edges are deleted in FIG. 12e according to step 646. 
Thereafter, a new starting node and first edge are selected in FIG. 12f, then a 
second edge is selected in FIG. 12g, and a third edge is selected in FIG. 12h. 
Since no further edges can be selected from the node v n 4 , the chain is returned 
and appended to the layout. The visited edges are deleted in FIG. 14i, and, since 
there exist no further edges in the bipartite graph, the layouts are returned at step 
650. 

[0075] FIGS. 13a-e are duplicative of FIGS. 12a-e, but FIG. 13f illustrates 
the point at which a layout is obtained and what the layout looks like. Note that 
the layout of FIG. 13f constitutes a portion of layout 426 of FIG. 5b. Thereafter, 
FIGS. 13g-j are duplicative of FIGS. 12f-i, but FIG. 13k illustrates what point at 
which an appended layout is obtained and what the final layout looks like. Note 
that the layout of FIG. 13k is the same as the layout 426 of FIG. 5b. 
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[0076] FIGS. 14a-i and 15a-k illustrate a second, alternative application of 
the constraint greedy walk portion of the chaining method described in connection 
with FIGS. 10-11 to the circuit depicted in FIGS. 5a and 6a-b. It is important to 
mention that FIGS. 14a-i and 15a-k are an alternative application of the chaining 

r 

method applied in connection with FIGS. 12a-i and 13a-k. There would be no 
reason to run both, as application either results in an optimal layout (evident by 
comparing the final layouts illustrated in FIGS. 13k and 15k). The point of 
providing FIGS. 14a-i and 15a-k is to show that the precise layout of the circuit 
depicted in FIGS. 5a and 6a cannot be predicted; instead, what can be predicted 
is the optimality of the resulting solution. FIG. 14b shows that it is possible to 
select a different second node, relative to the node selected in FIG. 12b. 
Following that selection, two mutually exclusive edges are deleted at FIG. 14c 
(shown in dashed lines), differing from the edges deleted in FIG. 12c. The 
chaining method runs its course according FIG. 10, similarly as described in 
connection with FIGS. 12a-i, until no further edges exist in the bipartite graph and 
a final layout is obtained. 

[0077] As before, FIGS. 15a-e are duplicative of FIGS. 14a-e, but FIG. 15f 
illustrates the point at which a layout is obtained and what the layout looks like. 
Thereafter, FIGS. 15g-j are duplicative of FIGS. 14f-i, but FIG. 15k illustrates 
what point at which an appended layout is obtained and what the final layout looks 
like. Note that the layout of FIG. 15f constitutes a portion of layout 424 of FIG. 5b, 
and that the final layout of FIG. 15k is the same as the layout 424 of FIG. 5b. 
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[0078] At least one novel aspect of the disclosed chaining method includes 
its ability to handle different types of MOS circuit topologies - including mixed- 
circuit topologies. A CMOS topology typically requires that a series circuit have a 
corresponding parallel circuit; therefore, an abutment between the two groups is 
guaranteed. A general topology does not guarantee this phenomenon, and the 
graph structure becomes more complex as a result. Without the carefully selected 
starting point at step 609 in FIG. 10, an optimal solution for such circuits cannot be 
obtained. Moreover, the disclosed chaining method includes an iterative 
capability at step 606 in FIG. 10. If a greedy walk happens upon an input that it 
cannot handle, it identifies and skips the problem area and continues chaining 
process until it fully consumes the graph, therefore generating a complete 
solution. Utilizing the disclosed method, any mixed-signal MOS circuit topology 
may be handled: any combination of MOS circuits, including CMOS, domino 
logic, and analog. 

[0079] Another novel aspect of various disclosed chaining methods relates 
to the speed with which they obtain an optimized layout. By observing that the 
bipartite graph trail is similar to an Euler graph trail algorithm (namely, when the 
graph has vertices with an odd number of edges, a maximal graph trail starts and 
ends in such a node), an optimal solution may be found in linear time. 
Furthermore, the trivial trail (an isolated edge in the graph, which yields a trail of 
one edge) is necessarily postponed, in order to attempt to build a longer trail first. 
Applying careful selection of the starting point, the method generally ensures that 
each transistor is visited only once during chaining process according to a 
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constraint-greedy walk through a bipartite graph. A single optimal solution is 
obtained without enumerating all the possible ones, resulting in a considerable 
speed advantage. 

[0080] The disclosed chaining methods may be used in both customizing 
layouts (user-driven module generation and editing) and creating automated 
layouts (automated module generation) when enhanced with optional partitioning 
and/or folding capabilities. Interactive module generation and editing permits a 
user, for a selected set of components, to specify folding parameters for each 
device, and then generate the layout for the selected circuit. Parameters can be 
adjusted until a satisfactory layout is obtained. Preferably, a standard "point-and- 
click" cross-selection mechanism between schematic and layout is employed, 
permitting hierarchical structures to be selected in the schematic and portions of 
the hierarchy to be de-selected in the layout if their generation is not desired. In 
FIG. 16b-d, a number of interactive chaining and folding scenarios are presented 
for the circuit in FIG. 16a. 

[0081] The layout in FIG. 16b shows the chain obtained for the entire circuit 
without requiring hierarchical partitioning. FIGS. 16c-d show two layouts obtained 
when the gates in the circuit have been manually partitioned vertically or 
horizontally respectively. Note that, in both layout examples, for each pair of 
NMOS and PMOS chains, the devices are ordered in such a way that all gates are 
aligned, hence gate routing can be completed in polysilicon. The devices are 
preferably abutted, requiring placement of contacts only where external 
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connections are necessary. Otherwise, device contacts are preferably not created 
for the internal nets of the chain in order to minimize layout area. 
[0082] For the top chains in FIG.16d, two possible folding scenarios are 
depicted in FIGS. 17a-b. Chain height bounds may be user-specified, such as in 
FIG. 17a, causing the layout will have a uniform height within these bounds and all 
the fingers of devices to have equal sizes. Alternatively, the number of fingers 
and the width of each device finger may be user defined, such as in FIG. 17b. 
This example shows the flexibility of the solution in using automatic chaining 
together with (a) automatic or (b) user defined folding. 

[0083] Beside interactive layout editing, a chaining engine according to the 
present invention, enhanced with known folding capabilities, can be used to 
generate a layout automatically, starting from a connectivity source (netlist or 
schematic). The same optimization methods described above are used to 
generate the full layout. However, due to the size of the problems, fully chained 
layouts (even for small circuits like in FIG. 16a) might not be optimal for placement 
due to the impractical aspect ratio of the module obtained. Hence, the engine 
needs to partition the circuit before generating the chains. 
[0084] A partitioning algorithm, part of the chaining engine as shown in FIG. 
4, may generate a solution according to the following constraints, listed in 
descending order of priority: 

1 . Preserve circuit hierarchy as specified by the user, chaining together 
devices within the same logical hierarchy. For a heterogeneous design 
hierarchy, a preferred solution performs a bottom-up hierarchical 
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partitioning. A user preferably specifies how many bottom-up hierarchical 
levels should be preserved by the partitioned For example, gate level 
clustering is one hierarchical level of partitioning. 

2. Scale the number of partitions linearly with the number of 
5 hierarchical partitions, independent of circuit size. The partitioning 

algorithm is preferably scalable linearly with the size of the circuit. 

3. Partition the circuit with a maximum partition size, in case 
hierarchical partitioning cannot be performed. This feature helps reducing 
the size of the chains generated to reasonable form-factors, such that 

10 automated placement can still handle them. 

[0085] Based on the above criteria, a 2 x n partitioning algorithm may be 
adapted to accommodate these conditions. This algorithm is generalized from one 
well known in the art (Paul Stravers, "Partitioning a network into n pieces with a 
time-efficient net cost function," Proceedings of Design Automation Conference, 

15 IEEE 1991, pp. 177-182, which is incorporated herein by reference), over a 
number of n cuts with a cost function whose complexity is log(fl). For a circuit of 
size p partitioned into n areas, the complexity of this algorithm has been 
demonstrated to be p x n log(n). Since the number of partitions n is fixed in our 
case for hierarchical circuits, the complexity of our implementation is linear with 

20 respect to circuit size p. 

[0086] In a preferred partitioning implementation, a cost function attempts 
to minimize wire length and to equalize area or the number of devices per 
partition. Moreover, when hierarchy information is inherited from the connectivity 
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source, the move space is reduced to the partitions in the same logical hierarchy. 
The partitioning algorithm works as follows. The bottom-up hierarchical 
partitioning is performed first, based on a user specification. For each partition in 
this step, optimized partitioning is performed in case the number of devices 
exceeds a size bound. After partitioning optimization, each partition is then 
submitted to the chaining module. 

[0087] A preferred folding algorithm parses a chain of devices and performs 
a two step procedure as follows: 

1. search for specific circuit topology patterns and apply a, custom 
folding schema, or 

2. fold devices individually in consecutive fingers in the chain using the 
device width bounds available. 

[0088] In a preferred implementation, all the user-specified folding is also 
preserved. The custom topology patterns include parallel devices and pseudo- 
parallel devices. FIGS. 18a-b show the folding strategy for various circuit 
topologies. Note that pseudo-parallel nets (resulting by splitting devices in FIG. 
18a) need not be routed when such a folded structure is created, because there is 
no current flow between the symmetrical branches of the circuit. The layout 
generated in the chaining engine is preferably annotated with this information. 
Given the interdigitated structure, such devices can be also folded with an even 
number of fingers. 

[0089] While preferred embodiments of the invention have been described 
herein, many variations are possible which remain within the concept and scope 
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of the invention. Such variations would become clear to one of ordinary skill in the 
art after inspection of the specification and the drawings. The invention therefore 
is not be to restricted except within the spirit and scope of any appended claims. 
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CLAIMS 

What is claimed is: 

[0090] 1 • A method for laying out a circuit design, comprising the steps 
of: 

obtaining a bipartite graph representing a circuit design in nodes and 

edges; and 

processing the bipartite graph to generate a layout file, such 
processing further including the steps of: 

selecting a starting point within the bipartite graph to ensure 
that each edge is visited only once; 

initiating a constraint greedy walk through the bipartite graph 
to identify paired NMOS and PMOS chains; and 

after the first walk is completed, initiating at least one 
additional walk through the bipartite graph for the remaining graph edges to 
optimize the remaining chains in the circuit. 

[0091] 2. The method of claim 1 , wherein the circuit is characterized by 
multiple topologies. 

[0092] 3. The method of claim 1, wherein the processing steps are 
completed in linear time relative to the complexity of the circuit. 
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